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ABSTRACT 



The apparatus for processing Standard generalized markup 
language document generates a SGML conversion form file 
correlating a prototype file having a specific form with 
document type definition by a SGML conversion form 
generation module, and converts, in a case where a plain text 
document prepared using the specific form is converted to a 
SGML document, the prepared plain text document to the 
SGML document by referring to the SGML conversion form 
file in a SGML document generation module, while in a case 
the reverse conversion is executed, the converted SGML 
document is reversely converted to the plain text document 
by referring to the SGML conversion form file in a SGML 
document read-in module. 

11 Claims, 11 Drawing Sheets 



4- 



101 



SGML CONVERSION 
FORM GENERATION 
MODULE 



104 



DOCUMENT TYPE 
DEFINITION FILE 



SGML DOCUMENT 

GENERATION 

MODULE 




105 



PROTOTYPE 
FILE 



SGML DOCUMENT 
READ-IN MODULE 




SGML 
DOCUMENT 



PLAIN TEXT 
DOCUMENT 



WEST 



4 

U.S. Patent 



• 

Mar. 13, 2001 Sheet 1 of 11 



US 6,202,072 Bl 



FIG.1 



101 

$ 

SGML CONVERSION 
FORM GENERATION 
MODULE 




104 

S _ 

DOCUMENT TYPE 
DEFINITION FILE 

105 



SGML CONVERSION 
FORM FILE 



PROTOTYPE 
FILE 



SGML DOCUMENT 

GENERATION 

MODULE 




103 



SGML DOCUMENT 
READ-IN MODULE 




SGML 
DOCUMENT 



PLAIN TEXT 
DOCUMENT 



WEST 



U.S. Patent Mar. 13, 2001 Sheet 2 of 11 



US 6,202,072 Bl 



CM 

d 



LU 

a 



O 

o 
o 



LU 

cc 
cc 

o 



3 
O 

o 
a 

X 

z 
a 



> 



o 



z 

o 

< 

o 
-J 
a. 

< 

CC 

O 

LL 
r- 

z 

LU 

D 

O 

o 

Q 



Z 

LU 

2 a. 

< LU 
Z Q 



LL 

(0 . 



ac 
O 

UL 

cc 
O 
u. 

Z 

g 

o 
-i 

o. 

< 



<0 
<0 
LU 
CC 
Q 

a 
< 







* 

i 


UJ 








» 










ac 








1 










UJ 














1 




X 






RE 




* 








c 






LU 






LU 












X 






CC 


z 




Z 


Z 






z 




LU 


LU 




LU 


LU 




RESS" 


LU 




X 


















D 










13 




Ui 


O 




cc 


O 




O 






o 




< 

CL 


O 




G 


o 




< 


o 


LU 

or 


LU 


G 


UJ 

cc 


G 


G 


UJ 

cc 


z 


o 


O 


(D 


< 


O 




z 


LU 




Z 


UJ 




z 


LU 


LL 


O 


X 


U_ 


O 


X 


LL 


G 


X 


0 


z 


LU 


O 


Z 


UJ 


O 


Z 


LU 


co 


o 


CC 




o 


cc 


CO 


0 


ac 




a. 


D 




a. 




H 


Q_ 




Z 


CO 




z 


co 




Z 


CO 




LU 


LU 


& 


LU 


LU 




LU 


LU 






CC 


D 


H 


ac 






CC 


TRIM 


Z 


cc 


CC 


Z 


ac 


ac 


Z 


cc 


0 


o 




o 


O 


H 


o 


o 


o 


o 


& 


o 


O 




o 


o 


(0 








1 














* 




1 , 




C0\ 




\ 




CO \ 




% 

t 




RE 




1 


i 


G 








Q 








< 
















< 
















i 

LU 




H 




a 




Z 




z 




LU 


CO 


cc 






CO 


LU 




CC 


LU 


h- 


LU 


< 


CC 


co 




Q_ 


G 


O 


< 


LU 


G 


LU 


z 


G 


< 


CC 









WEST 



U.S. Patent Mar. 13, 2001 Sheet 3 of ll US 6,202,072 Bl 



00 



(b) 



(c) 



(d) 



(e) 



FIG.3 

PLAINTEXT DOCUMENT (SAME FORM AS PROTOTYPE) 



APPLICATION FORM FOR 
REGISTERING E-MAIL ADDRESS 



NAME 


TOKKYO TARO 


DEPARTMENT 


DEVELOPMENT** 


ADDRESS 


taro@ABC 



"TOKKYO TARO" 

"DEVELOPMENT" 

"taroOABC" 



INPUT BY USER 



^STORE IN SGML FORM 



PLAIN TEXT DOCUMENT— SGML DOCUMENT 



SGML CONVERSION FORM 



SGML DOCUMENT 



{DOCUMENT FOR APPLICATION) 
<STAFF> 

{NAME) TOKKYO TARO </NAME) 
{DEPARTMENT) DEVELOPMENT (/DEPARTMENT) 
{ADDRESS) taro@ABC {/ADDRESS) 
{/DOCUMENT FOR APPLICATION) 



4' 



r READ-IN SGML DOCUMENT 



SGML DOCUMENT— PLAIN TEXT DOCUMENT 



SGML CONVERSION FORM 



PLAIN TEXT DOCUMENT 



APPLICATION FORM FOR 
REGISTERING E-MAIL ADDRESS 



NAME 


TOKKYO TARO 


DEPARTMENT 


DEVELOPMENT 


ADDRESS 


taro@ABC 



WEST 



U.S. Patent Mar. 13, 2001 Sheet 4 of 11 



US 6,202,072 Bl 



o 



o 



o> 



z 

2 i- 

g|z 

uj 2 uj 

> o si 

oc O o 

O Q < 

2 a: Z 

W O < 

2 u. 2 





tu 




CL 


ai 




-j 




u. 


ENT 


ION 




H 




Z 


O 


LL 


o 


LU 




Q 



UJ 
Q. 
> 

H 
O 
CC 
CL 



UJ 



CO 



6 



2 
< 



CO 

CM 



CM 



o s 
c "J 



(0 

O 



< 

CM 



3 
CL 

O 



CO 





CO 



- O 



UJ 

O 

o 

G 



O 

UJ 

co 
> 

CC 

O 

UJ 

2 



in 



UJ 

-j 
00 

Z 

o 

CO 

<r 

UJ 

> 
z 
o 
o 



w III 

UJ d V 
> U. 



O o 
-J u. 

CD 
CO 



CO 

o 



WEST 



i 



U.S. Patent Mar. 13, 2001 Sheet 5 of ll US 6,202,072 Bl 



FIG.5 



PROCESSING FOR 
GENERATING SGML FORM 



I 



READ-IN PROTOTYPE FILE 
OF PLAIN TEXT DOCUMENT 



^ ^ . S2 



4- 



CORRELATE EACH TAG 
OF DOCUMENT TYPE 
DEFINITION TO EACH FIELD 



4- 



GENERATE SGML 
CONVERSION FORM 



I 



REGISTER SGML 
CONVERSION FORM FILE 



I 



END 



WEST 



U.S. Patent Mar. 13, 2001 Sheet 6 of 11 US 6,202,072 Bl 



FIG.6 



BIRTH-DATE 



ADDRESS 



POSTAL 
CODE 



METROPOLIS 
-DISTRICT 



CITY-TOWN 
-VILLAGE 



HOUSE 
NUMBER 



DATE : / 


r 


NAME 




SEX 








KEN • TO • FU 



-SHI 



-CHOME 



-BAN 




WEST 

i 



U.S. Patent Mar. 13, 2001 Sheet 7 of 11 



US 6,202,072 Bl 



FIG.7 

5 


FIELD NAME 


TAG NAME 


NAME FIELD 


(name) 


DATE FIELD 


(date) 


SEX FIELD 


<SEX) 


BIRTH-DATE FIELD 


(BIRTH- DATE) 


METROPOLIS-DISTRICT 
FIELD 


(METROPOLIS 
-DISTRICT) 


POSTAL CODE FIELD 


(POSTAL CODE) 


CITY-TOWN-VILLAGE 
FIELD 


(CITY-TOWN 
-VILLAGE) 


HOUSE NUMBER FIELD 


(HOUSE NUMBER) 




* 
* 
* 



U.S. Patent Mar. 13, 2001 Sheet 8 of 11 US 6,202,072 Bl 

FIG.8 



PROCESSING FOR 
GENERATING SGML 
DOCUMENT 



I 



_^sii 



GENERATE PLAINTEXT 
DOCUMENT USING 
PROTOTYPE FORM 



I 



^ S12 



CONVERT PLAIN TEXT 
DOCUMENT TO SGML 

DOCUMENT WITH 
REFERENCE TO SGML 
CONVERSION FORM 



I 



^ S13 



OUTPUT SGML DOCUMENT 




WEST 



U.S. Patent Mar. 13, 2001 Sheet 9 of 11 US 6,202,072 Bl 



FIG.9 



a b 


DATE : AUGUST 30, 1996 ' 




NAME 


TOKKYO TARO 


SEX 


MALE, 


c 


BIRTH-DATE 


APRIL 29, 1958 ^ 


d 



ADDRESS 



POSTAL 
CODE 



METROPOLIS 
-DISTRICT 



CITY-TOWN 
-VILLAGE 



HOUSE 
NUMBER 



102-91 



TOKYO KEN • TO • FU 



MITAKA-SHI, KAMIRENJAKU 



1-23- 




WEST 



U.S. Patent Mar. 13, 2001 Sheet 10 of 11 



US 6,202,072 Bl 



FIG.10 



(APPLICATION FORM) 

(DATE) AUGUST 30, 1996 (/DATE) 

(NAME) TOKKYOTARO (/NAME) 
(SEX) MALE (/SEX) 

(POSTAL CODE) 102-91 (/POSTAL CODE) 
(METROPOLIS-DISTRICT) TOKYO (/METROPOLIS- DISTRICT) 
(CITY -TOWN-VILLAGE) MITAKA-SHI (/CITY-TOWN-VILLAGE) 
(TOWN-NAME) KAMIRENJAKU (/TOWN-NAME) 
(HOUSE NUMBER) 1-23 — 



WEST 



U.S. Patent Mar. 13, 2001 Sheet 11 of 11 US 6,202,072 Bl 



FIG. 11 



PROCESSING FOR 
READING SGML 
DOCUMENT 



I 



4- 



S21 



READ OUT SGML 
DOCUMENT 



I 



+ 



S22 



CONVERT SGML 
DOCUMENT TO PLAIN TEXT 
DOCUMENT WITH 
REFERENCE TO 
SGMLCONVERSION FORM 



± 



^ S23 



OUTPUT PLAIN TEXT 
DOCUMENT 




WEST 



US 6 3 2( 

1 

METHOD AND APPARATUS FOR 
PROCESSING STANDARD GENERALIZED 
MARKUP LANGUAGE (SGML) AND 
CONVERTING BETWEEN SGML AND 
PLAIN TEXT USING A PROTOTYPE AND 
DOCUMENT TYPE DEFINITION 

FIELD OF THE INVENTION 

The present invention relates to method and apparatus for 
processing a standard generalized markup language docu- 
ment for converting a plain text document prepared by a user 
using a word processor or that sort of software to a standard 
generalized markup language document (hereafter SGML 
document), and to a computer-readable recording medium 
with a program making a computer execute the method 
stored therein. 

BACKGROUND OF THE INVENTION 
Conventionally, in a document processing apparatus such 
as a word processor or the like, for planning an interface 
between a software and the database, a mode for saving the 
data in a SGML format is provided. In a SGML document 
in compliance with the SGML-standard, the document struc- 
ture is denned according to the document type definition 
(DTD). 

For this reason, when preparing a SGML document 
according to the document structure specified by the docu- 
ment type definition, it is necessary to add ta^ indicating a 
structural element to each of the input items in a plaiu text 
document. Accordingly, tags are inputted generally by oper- 
ating a keyboard. 

However, with recent technological development in a 
related field, there has been proposed SGML converting 
techniques in which a previously prepared plain text docu- 
ment is analyzed by software for syntaxes and tags indicat- 
ing a structural element are automatically added to each of 
the obtained syntaxes. Examples of the technique are 
disclosed, for instance, in Japanese Patent Laid-Open Pub- 
lication No. HEI 5-101054, Japanese Patent Laid-Open 
Publication No. HEI 7-44560, and in Japanese Patent Laid- 
Open Publication No. HEI 8-221399. 

In Japanese Patent Laid-Open Publication No. HEI 
5-101054, there is disclosed a technology for marking up a 
text document according to pre-specified document type 
definition, dividing the marked up text document to 
components, storing the components in a database, retriev- 
ing desired components from the database when a document 
is to be prepared, and synthesizing the components obtained 
through the retrieval to obtain the desired document. Also, in 
the Japanese Patent Laid-Open Publication No. HEI 
7-44560, there is disclosed a technology for obtaining a 
SGML document from a marked up text document prepared 
according to the SGML regulations, by generating a logical 
structure specific to that text document. 

Also, in the Japanese Patent Laid-Open Publication No. 
HEI 8-221399, there is disclosed a technology for obtaining 
a SGML document by (1) referring to correspondence 
between temporary tags, indicating a pre -prepared logical 
structure of a text document specified by a user, and a 
plurality of document type definitions, (2) obtaining the 
degree of coincidence between the original text prepared by 
the user using the temporary tags and each of the document 
type definitions, and (3) selecting one of the document type 
definitions displayed in the order of the degree of the 
coincidence thereamong by the user. 

In the conventional technology, as disclosed in the 
publications, to prepare a desired SGML document, prepa- 
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ration of samples such as components, marked-up texts, and 
document type definitions or the like, and processing such as 
retrieval of the samples, synthesis thereof, generation of a 
logical structure, and determination of the degree of coin- 

5 cidence thereamong or the like, is required. This processing 
style is suited for SGML conversion of a text with the form 
not fixed, however, in the case of text documents having 
specific form such as a form paper or the like, the work- 
ability is low. For this reason, improvement thereof has been 

10 expected in recent years. 

SUMMARY OF THE INVENTION 

It is an object of the present invention to obtain an 
apparatus for processing SGML document in which it is 
15 possible to improve workability of SGML conversion of a 
plain text document having a fixed form, a method for 
processing a SGML document for the same, and a computer- 
readable recording medium with a program making a com- 
puter execute the method stored therein. 

20 

The apparatus for processing SGML document according 
to the present invention generates a conversion form for 
conversion between a prototype document having a specific 
form and a document type definition by correlating struc- 

25 rural elements in the prototype document to those in the 
document type definition respectively, and registers the 
generated conversion form so that it can be referred to when 
converting a plain text document prepared by using the 
prototype document is converted to a SGML document. For 

30 this reason a conversion form corresponding to the specific 
form may be generated only once, and since it is not required 
to prepare samples such as components, markup documents, 
and document type definitions every time as in the case of 
the conventional technology, it is possible to improve the 

35 workability thereof. 

The apparatus for processing SGML document according 
to the present invention, generates a conversion table for 
conversion between a prototype document having a specific 
form and document type definition by correlating fields of 

40 the prototype document to tags of the document type 
definition, and registers the generated conversion table so 
that it can be referred to when converting a plain text 
document prepared by using the prototype document to a 
SGML document having tags. For this reason a conversion 

45 table corresponding to the specific form may be generated 
only once, and since it is not required to prepare that samples 
such as components, markup documents, and document type 
definition every time like in the case of the conventional 
technology, it is possible to improve the workability thereof, 

50 The apparatus for processing SGML document according 
to the present invention, previously registers a conversion 
form in which structural elements in a prototype document 
are correlated to those in document type definition for 
conversion between the prototype document having a spe- 

55 cine form and the document type definition, prepares a plain 
text document using the specific form, converts the prepared 
plain text document to a SGML document by referring to the 
registered conversion form, so that, if a conversion form 
corresponding to the specific form is generated once, a 

60 required work thereafter and on is only to prepare a plain 
text document in the specific form. For this reason, it is not 
required that samples such as components, markup 
documents, and document type definition are prepared every 
time like in the case of the conventional technology for 

65 execution of processing such as retrieval of the samples, 
synthesis thereof, generation of a logical structure, and 
deterrnination of the degree of coincidence thereamong or 
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the like, which makes it possible to improve the workability reason, it is not required to prepare samples such as 
for conversion from a plain text document with specific form components, markup documents, and document type defi- 
to a SGML document. nition are prepared every time like in the case of the 
The apparatus for processing SGML document according conventional technology for execution of processing such as 
to the present invention, previously registers a conversion 5 retrieval of the samples, synthesis thereof, generation of a 
table in which fields of a prototype document is correlated logical structure, and determination of the degree of coin- 
to tags of document type definition for conversion between cidence thereamong or the like, which makes it possible to 
the prototype document having a specific form and the improve the workability for conversion from a plain text 
document type definition, prepares a plain text document document with a specific form to a SGML document and it 
using the specific form, and converts the prepared plain text *> is also possible to realize a two-directional conversion 
document to a SGML document having tags by referring to between a plain text document and a SGML document with 
the registered conversion table, so that, if a conversion table higher workability in preparation of a document having a 
corresponding to the specific form is prepared once, a specific form. 

required work thereafter and on is only to prepare a plain The recording medium according to the present invention 

text document in a specific form. For this reason it is not 15 stores therein a program for making a computer execute the 

required that samples such as components, markup steps described above, whereby the program becomes 

documents, and document type definition are prepared every machine -readable, and with this feature, operations in the 

time like in the case of the conventional technology for steps can be realized by a computer, 

execution of processing such as retrieval of the samples, other objects and features of this invention will become 

synthesis thereof, generation of a logical structure, and 20 clear froni the following description with reference to the 

determination of the degree of coincidence thereamong or accompanying drawings, 
the like, which makes it possible to improve the workability 

for conversion from a plain text document with a specific BRIEF DESCRIPTION OF THE DRAWINGS 

form to a SGML document. 1 is a block diagram illustrating functional features 

In the apparatus for processing SGML document accord- of ^ apparatus for p roceS sing SGML document according 

ing to the present invention, a specific form has fields such to ^ embodiment of the present invention; 

as ruled lines, an application form, or form paper or the like, „ T _ _ . . f -n * *• *u • ■ i r 

r . /f * • * • * j » • • *u FIG. 2 is a view for illustrating the principles of the 

for inputting the data so that a user just inputs data into the A . . . , . iL ~? . C^wr 

- . , r ... , . . ,, . J ., 4 r , t operations involved in the preparation of a SGML conver- 

fie Ids without bothering to add tags thereto, which makes it t c . r *L 

, j iU i , 4 u 30 sion form according to the embodiment: 

possible to reduce the load on the user. ° 7 

™ , - . cr ,,„ . f j • FIG. 3 is a view for illustrating the principles of the 

Tlie apparatus for processing SGML document according . . 1J . il _ - . Jl r , . « * 

*- £. 4 a. i • * * operations involved in the conversion between a plain text 

to the present invention, further reconverts the plain text ,_ iimon . n , „ CP w T <■ trt t , *\ , 

j * £. 4 l cnwT j * l * • * *u document and a SGML document according to the embodi- 

document from the SGML document by referring to the men j. 

registered conversion table, so that it is possible to realize a „ ' 

two-directional conversion between a plain text document FIG - 4 fc a block diagram showing the hardware of the 

and a SGML document with higher workability in case of apparatus for processing SGML document according to the 

documents having a specific form. embodiment; 

The method for processing SGML document according to FIG - 5 * a flow chart for illustrating the processes 

the present invention comprises steps for generating a con- 40 involved in the generation of a SGML conversion form 

version table for conversion between a prototype document according to the embodiment; 

having a specific form and document type definition by FIG. 6 is a view showing an example of a prototype 

correlating the field of the prototype document with the tags document having a specific form in the embodiment; 

of the document type definition, and registering therein the FIG. 7 is a view showing an example of a conversion table 

generated conversion table so that it can be referred to when 45 according to the embodiment; 

a plain text document prepared by using the prototype FIG 8 ^ a flow chart for illustrating the processes 
document is converted to a SGML document having tags. involved in the generation of a SGML document according 
For this reason a conversion table corresponding to the to ^ c embodiment- 
specific form may be generated only once, and since it is not _ _ . • ' L • * r 1 ■ * * 
. . . 1 u * 1 FIG. 9 is a view snowing an example of a plain text 
required to prepare samples such as components, markup < n , A . At , .. f 
, j j « c *»• *• 11 * document in the embodiment; 
documents, and document type definitions every time like in ' 

the case of the conventional technology, it is possible to FIG - 10 is a view showing an example of a SGML 

improve the workability thereof. document in the embodiment; and 

The method for processing SGML document according to FIG - U is a flow chart for illustrating the processes 

the present invention comprises steps for previously regis- 55 involved in reading in of a SGML document according to the 

tering a conversion table correlating fields of a prototype embodiment. 

document with tags of document type definition for conver- DESCRIPTION OF THE PREFERRED 

sion between a prototype document having a specific form EMBODIMENTS 
and the document type definition, preparing a plain text 

document using the specific form, converting the prepared 60 Detailed description is made hereinafter for preferred 

plain text document to a SGML document having tags by embodiments of the method and apparatus for processing 

referring to the registered conversion table, and converting SGML, and a computer-readable recording medium with a 

in reverse the converted SGML document to the plain text program making a computer execute the method stored 

document by referring to the registered converting table, so therein with reference to the related drawings, 

that, if a conversion table corresponding to the specific form 65 At first, a description is provided of the basic change 

is generated once, a required work thereafter and on is only principles of the invention. FIG. 1 is a block diagram 

to prepare a plain text document iD a specific form. For this illustrating functional features of the apparatus for process- 
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ing SGML document according to an embodiment of the 
present invention. This apparatus for processing SGML 
document comprises, as shown in FIG. 1, a SGML conver- 
sion form generation module 101, a SGML document gen- 
eration module 102, and a SGML document read-in module 
103. 

The SGML conversion form generation module 101 gen- 
erates a SGML conversion form file 106 for conversion 
between a document type definition file 104 and a prototype 
file 105 correlating structural elements of the document type 
definition file 104 to those of the prototype file 105 based on 
a plain text document respectively. The SGML document 
generation module 102 refers to the SGML conversion form 
file 106 to generate a SGML document 107 from a plain text 
document 108 prepared by a user. The SGML document 
read-in module 103 refers to the SGML conversion form file 
106 to generate a plain text document 108 from a SGML 
document already generated by the SGML document gen- 
eration module 102. 

Next, a description is made of principles of preparing the 
SGML conversion form file 106. FIG. 2 is a view for 
illustrating the principles of the operations involved in the 
preparation of a SGML conversion form file according to the 
embodiment. Preparation of this SGML conversion form file 
106 is one of the jobs performed by the SGML conversion 
form generation module 101. Three types of information, as 
shown in FIG. 2, such as a prototype of a plain text 
document(prototype file 105), document type definition 
(document type definition file 104), and data correlating 
these files with each other is required for preparation of a 
SGML conversion form. 

The three types of information can be obtained from 
interactive operations between a computer and a user, by 
using a user interface, through input means such as a 
keyboard or the like as well as through a data display means 
responding to the input. A SGML conversion form is pre- 
pared on the basis of a relation among those three types of 
information. The generated SGML conversion form realizes 
two-directional conversion between a plain text document 
and a SGML document by correlating the prototype with the 
document type definition. 

The example in FIG. 2 shows, as a prototype of a plain 
text document, an application form for registering an e-mail 
address(described as an application form hereinafter). Pro- 
vided in this application form are fields for name, 
department, address (indicating an e-mail address). A user 
just inputs data in the form of character string(name, 
department, address) into each of the fields thereof, to 
prepare a plain text document. 

It is necessary to show what structure each of the fields for 
name, department, and address have in the document type 
definition in order to generate a SGML conversion form 
relating to this application form. For instance, in the case of 
name, by defining a field in the prototype (application form) 
that should correspond to the field for the name in the 
document type definition, it is possible to obtain data for 
correlation between the application form and the document 
type definition. 

Similarly, by correlating a corresponding document struc- 
ture in the document type definition with each of the fields 
for department and address respectively, it is possible to 
obtain data for correlation therebetween. Finally, by corre- 
lating an application form and the document type definition 
to data for correlation therebetween as one unit, a SGML 
conversion form is prepared. 

The next description concerns the principles of conver- 
sion between a plain text document and a SGML document. 
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FIG. 3 is a view for illustrating the principles of the 
operations involved in the conversion between a plain text 
document and a SGML document according to the embodi- 
ment. The conversion from a plain text document 108 

5 having a prototype form to a SGML document 107 is one of 
the jobs performed by the SGML document generation 
module 102. On the other hand, the reverse conversion from 
the SGML document 107 to the plain text document 108 is 
one of the jobs performed by the SGML document read-in 

10 module 103. 

The SGML conversion form file 106 is required for the 
conversion and the reverse conversion. That is for the 
conversion of a plain text document having a specific form 
to a SGML document, and for the conversion of the SGML 

15 document to a plain text document having a specific form as 
in the case of an application shown in FIG. 2. 

Description is made for a case where SGML conversion 
is executed by actually inputting concrete data into the 
application form shown in FIG. 2 with reference to FIG. 3. 

20 The application form displayed on a display has, as shown 
in FIG. 2, fields for inputting name, department, and address. 
For instance, when "TOKKYO Taro" is inputted into the 
name field by a user through a user interface, "Develop- 
ment" into the department field, and "taro@ABC" into the 

25 address field respectively, each of the data is displayed on 
each of the fields(refer to FIG. 3(a)). 

When data input is over and an instruction for conversion 
of the data to a SGML document is given by the user, the 

3Q application form is converted to a SGML document. In the 
process, the SGML conversion form file 106 generated 
beforehand is referred to, and a position of a document 
structure corresponding to each of the fields in the applica- 
tion form is found out from the document type definitions. 

35 Then the data inputted into each of the fields is set in a 
position for the corresponding document structure (FIG. 
3(b)). 

More specifically, because the SGML document is one 
having tags, a position for a document structure correspond- 

40 ing to each of the fields is a position at which a tag 
corresponding to the field is provided. Accordingly, as 
shown in FIG. 3(c), the data "TOKKYO Taro" inputted in 
the name field is set in the position in between <Name> and 
</Name> tags, "Development" inputted in the position field 

45 is set in the position in between <Department> and 
</Department> tags, and "taro@ABC" inputted in the 
address field is set in the position in between <Address>and 
</Address> tags. As a result, a SGML document is gener- 
ated. 

50 The SGML document obtained by converting the data 
thereto as shown in FIG. 3(c) can also be reconverted to the 
original plain text document. In this case, the way of 
referring to the SGML conversion form file 106 is executed 
in reverse order to that as described above. Namely, the 

55 generated SGML conversion form file 106 is referred to, and 
at first, each of the document structures on the SGML 
document, namely a position for each of the fields corre- 
sponding to each tag is found out from the prototype file. 
Then data set to each tag is set in the corresponding 

60 field(FIG. 3(d)). 

More specifically, because the plain text document is the 
one having a specific fields, a position on the prototype 
corresponding to each tag is a position for each of the fields. 
Accordingly, as shown in FIG. 3(e), the data "TOKKYO 

65 Taro" set in the position in between <Name> and </Name> 
tags in the application form is set in the name field, "Devel- 
opment" set in the position in between <Department> and 
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</Department> tags is set in the department field, and 
"taro@ABC" set in the position in between <Address> and 
</Address> tags is set in the address field. As a result, the 
SGML document is reconverted to the plain text document. 

Herein, an example of the hardware configuration of the 5 
apparatus for processing SGML document is explained. 
FIG. 4 is a block diagram showing the hardware of the 
apparatus for processing SGML document according to the 
embodiment. The SGML document processing apparatus 
comprises, as shown in FIG. 4, units such as a CPU 1, a 10 
program memory 2 connected to said CPU 1 by an internal 
bus 9, a RAM 3, a memory section 4 for document 
management, a conversion table 5, a document memory 
section 6, an operating section 7, and a display section 8. It 
should be noted that the internal bus 9 is a transmission line is 
for control signals, data, and address signals or the like. 

The CPU 1 is a unit for controlling operations of the entire 
apparatus according to various programs stored on the 
program memory 2. Namely, the CPU 1 controls the execu- 
tion of the various programs stored on the program memory 20 
2, read/write of data from/in the RAM 3, read/write of data 
from/in the memory section 4 for document management or 
document memory section 6, receipt of key entry from the 
operating section 7, and display on the display section 8 or 
the like. 25 

The program memory 2 stores thereon operating system 
OS2A, application program AP2B, and various types of 
parameters required for the operations of the CPU 1 or the 
like. Included in the application program AP2B are pro- ^ 
grams for realizing each function of the SGML conversion 
form generation module 101, SGML document generation 
module 102, and SGML document read-in module 103. The 
RAM 3 is used as a work area when the CPU 1 is in 
operation. ^ 

The memory section 4 for document management stores 
therein the document type definition file 104 and the pro- 
totype file 105. This memory section 4 for document man- 
agement can change the contents registered in the document 
definition file 104 and the prototype file 105 or adds contents 4Q 
thereto. The conversion table 5 converts the SGML conver- 
sion form file 106 to the table to be registered therein. Not 
only the SGML conversion form file 106 but also a plurality 
types of conversion form can be registered in this conversion 
table 5. The document memory section 6 stores therein the 45 
SGML document 107 and plain text document 108. 

The operating section 7 is a unit that plays a rote of a user 
interface and has keys and switches for instructing, through 
key entry, operations for generation of a SGML conversion 
form, generation of a SGML document, and read-in of a 50 
SGML document or the like. The display section 8 displays 
data when each of the functions of the SGML document 
conversion form generation module 101, SGML document 
generation module 102, and SGML document read-in mod- 
ule 103 is performed according to the control by the CPU 1. 55 

The next description concerns operations involved in the 
of generation of a SGML conversion form by using the 
hardware configuration shown in FIG. 4. FIG. 5 is a flow- 
chart for explaining processes involved in the generation of 
a SGML conversion form according to the embodiment, eo 
FIG. 6 is a view showing an example of a prototype 
document having a specific form in the embodiment, and 
FIG. 7 is a view showing an example of a conversion table 
according to the embodiment. 

The description assumes hereinafter a prototype form for 65 
an application form shown in FIG. 6 as one example of a 
specific form used for ruled lines, an application form, or 
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form paper or the like. At first, an application form which is 
a prototype form for a plain text document is prepared by 
inputting characters and ruled lines according to the oper- 
ating section 7 through display on the display section 9 (step 
SI). This application "comprises a plurality of fields from a 
to h. These fields are allocated along the ruled lines. In the 
figure, designated at the reference letter a is a name field, at 
b is a date field, at c is a sex field, at d is a birth-date field, 
at e is a metropolis- district field, at f is a postal code field, 
at g is a city-town-village field, and at h is a house number 
field respectively. 

When preparation of the prototype form of the application 
form is finished in step SI, the prototype form thereof is 
registered in the memory section 4 for document manage- 
ment as a prototype file 105. Further, each of the fields a to 
h is correlated to each tag of the document type definition 
along the prototype form on the display through key opera- 
tions by the operating section 7. Document type definitions 
and data correlated thereto are prepared through the opera- 
tion for correlation(step S2). 

As a result, as shown in FIG. 7, the SGML conversion 
form is prepared in which each of tags is correlated to each 
of the fields such as, a tag <Name> to the name field, a tag 
<Date> to the data field, a tag <Sex> to the sex field, a tag 
<Birth-date> to birth-date field, a tag <Metropolis-district> 
to the metropolis-district field, a tag <Postal code> to the 
postal code field, a tag <City-town-village> to the city-town- 
village field, and a tag <House number> to the house number 
field respectively(step S3). Further, this SGML conversion 
form is registered in the conversion table 5 as a file (step S4). 

Then, description is made for operations involved in the 
generation of a SGML document. FIG. 8 is a flow chart 
showing a processing sequence in generation of a SGML 
document according to the embodiment, FIG. 9 is a view 
showing an example of a plain text document in the 
embodiment, and FIG. 10 is a view showing an example of 
a SGML document in the embodiment. 

Since a SGML conversion form is used in the generation 
of the SGML document, it is necessary that the SGML 
conversion form is registered in the conversion table 5 
beforehand. Accordingly, a desired prototype form is first 
read out from the prototype file 105 registered in the 
memory section 4 for document management, and then the 
plain text document is prepared. 

For example, in a case where a prototype form for an 
application shown in FIG. 6 is read out, the prototype form 
is displayed on the display screen of the display section 8, 
so that a user may input desired data in each of the fields by 
operating the operating section 7. The plain text document 
can easily be prepared through the operations (step Sll). 
When each data is inputted in each of the fields as described 
below, the plain text document is prepared as shown in FIG. 
9, for instance, "TOKKYO Taro" in the name field a, 
"August 30th, 1996" in the date field b, "Male" in the sex 
field c, "April 29th, 1958" in the birth-date field d, "Tokyo- 
to" in the metropolis-district field e, "102-91" in the postal 
code field f, "Mitaka-shi" in the city-town-village field g, 
and "1-23 ..." in the house number field h. 

Then, by referring to the conversion form relating to this 
application form namely the SGML conversion form file 106 
(the format in FIG. 7) registered in the conversion form 5, 
the plain text document shown in FIG. 9 is converted to the 
SGML document shown in FIG. 10. Namely, along the 
SGML conversion form file 106, "TOKKYO Taro" in the 
name field a is set to a space between tags <Name> and 
</Name>, then "August, 30th, 1996" in the date field b to a 



US 6,2< 

9 

space between tags <Date> and </Date>, "Male" in the sex 
field c to a space between tags <Sex> and </Sex>, "April 
29th, 1958" in the birth-date field d to a space between tags 
<Birth-date> and </Birth-date>, "Tokyo-to" in the 
metropolis-district field e to a space between tags 
<Metropolis-<iistrict> and </Metropolisdistrict>, "102-9 1" 
in the postal code field f to a space between tags <Postal 
code> and </Postal code>, "Mitaka-shr" in the city-town - 
village field g to a space between tags < City-town-village > 
and </City-town-village>, and "1-23 ..." in the house 
number field h to a space between tags <House number> and 
</House number> respectively. 

The SGML document generated as described above is 
registered in (outputted to) the document memory section 6 
together with the corresponding plain text document, and in 
addition, the document can be displayed on the display 
section 8 according to the instruction from the user(step 
S13). 

Further, description is made for operations involved in the 
reading of a SGML document. FIG. 11 is a flow chart for 
illustrating the processes involved in the reading of a SGML 
document according to the embodiment. This operation for 
reading a SGML document is also a processing for convert- 
ing in reverse from a SGML document the plain text 
document. Namely, the above operation indicates that the 
user accesses the document memory section 6 by operating 
the operating section 7 to select a desired SGML document, 
and then the SGML document is read out(step S21). With 
this operation, the SGML document displayed on the display 
section 8 is a SGML document in which tags are arranged 
on both the sides of every data as shown in FIG. 10. 

[n a case where the user selects the SGML document in 
FIG. 10 and instructs the reverse conversion thereof by 
operating the operating section 7, by referring to the con- 
version form relating to the SGML document namely the 
SGML conversion form file 106 (the format in FIG. 7) 
having already been registered in the conversion table 5, the 
reverse conversion from the SGML document shown in FIG. 
10 to the plain text document shown in FIG. 9 is executed 
(step S22). Hie specific way of conversion is supposed to be 
clear from the principles as described above, so that descrip- 
tion thereof is omitted herein. As a result, the plain text 
document obtained by the reverse conversion is displayed on 
the display section 8(step S23). 

As described above, with this embodiment, the apparatus 
for processing SGML document generates a conversion 
table 5 for conversion between a prototype document having 
a specific form and document type definition by correlating 
fields of the prototype document to tags of the document 
type definition, and registers the generated conversion table 
5 so that it can be referred to when converting a plain text 
document prepared by using the prototype document to a 
SGML document having tags. For this reason a conversion 
table 5 corresponding to the specific form may be generated 
only once, and since it is not required to prepare samples 
such as components, markup documents, and document type 
definitions every time as in the case of the conventional 
technology it is possible to improve the workability thereof. 

The apparatus for processing SGML document previously 
registers a conversion table 5 correlating fields of a proto- 
type document to tags of document type definition for 
conversion between the prototype document having a spe- 
cific form and the document type definition, prepares a plain 
text document using the specific form, and converts the 
prepared plain text document to a SGML document having 
tags by referring to the registered conversion table 5, so that, 
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if a conversion table 5 corresponding to the specific form is 
prepared once, required work thereafter and on is only to 
prepare a plain text document in a specific form. For this 
reason it is not required that samples such as components, 

5 markup documents, and document type definitions are pre- 
pared every time like in the case of the conventional 
technology for execution of processing such as retrieval of 
the samples, synthesis thereof, generation of a logical 
structure, and determination of the degree of coincidence 

io thereamong or the like, which makes it possible to improve 
the workability for conversion from a plain text document 
with a specific form to a SGML document. 

In the apparatus for processing SGML document, a spe- 
cific form has fields such as ruled lines, an application form, 

15 or form paper or the like, for inputting the data so that a user 
just inputs data into the fields without bothering to add tags 
thereto, which makes it possible to reduce the load on the 
user. 

The SGML document processing apparatus converts in 

20 reverse the converted SGML document to the plain text 
document by referring to the registered conversion table 5, 
so that it is possible to realize two -directional conversion 
between a plain text document and a SGML document with 
higher workability in preparation of a document using a 

25 specific form. 

As described above, the apparatus for processing SGML 
document according to the present invention generates a 
conversion form for conversion between a prototype docu- 

3Q ment having a specific form and a document type definition 
by correlating structural elements in the prototype document 
to those in the document type definition respectively, and 
registers the generated conversion form so that it can be 
referred to when converting a plain text document prepared 

35 by using the prototype document is converted to a SGML 
document. For this reason a conversion form corresponding 
to the specific form may be generated only once, and since 
it is not required to prepare samples such as components, 
markup documents, and document type definitions every 

40 time as in the case of the conventional technology, it is 
possible to improve the workability thereof. 

The apparatus for processing SGML document according 
to the present invention, generates a conversion table for 
conversion between a prototype document having a specific 

45 form and document type definition by correlating fields of 
the prototype document to tags of the document type 
definition, and registers the generated conversion table so 
that it can be referred to when converting a plain text 
document prepared by using the prototype document to a 

50 SGML document having tags. For this reason a conversion 
table corresponding to the specific form may be generated 
only once, and since it is not required to prepare that samples 
such as components, markup documents, and document type 
definition every time like in the case of the conventional 

55 technology, it is possible to improve the workability thereof. 
The apparatus for processing SGML document according 
to the present invention, previously registers a conversion 
form in which structural elements in a prototype document 
are correlated to those in document type definition for 

60 conversion between the prototype document having a spe- 
cific form and the document type definition, prepares a plain 
text document using the specific form, converts the prepared 
plain text document to a SGML document by referring to the 
registered conversion form, so that, if a conversion form 

65 corresponding to the specific form is generated once, a 
required work thereafter and on is only to prepare a plain 
text document in the specific form. For this reason, it is not 
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required that samples such as components, markup 
documents, and document type definition are prepared every 
time like in the case of the conventional technology for 
execution of processing such as retrieval of the samples, 
synthesis thereof, generation of a logical structure, and 
determination of the degree of coincidence there among or 
the like, which makes it possible to improve the workability 
for conversion from a plain text document with a specific 
form to a SGML document 

The apparatus for processing SGML document according 
to the present invention, previously registers a conversion 
table in which fields of a prototype document are correlated 
to tags of document type definition for conversion between 
the prototype document having a specific form and the 
document type definition, prepares a plain text document 
using the specific form, and converts the prepared plain text 
document to a SGML document having tags by referring to 
the registered conversion table, so that, if a conversion table 
corresponding to the specific form is prepared once, a 
required work thereafter and on is only to prepare a plain 
text document in a specific form. For this reason it is not 
required that samples such as components, markup 
documents, and document type definition are prepared every 
time like in the case of the conventional technology for 
execution of processing such as retrieval of the samples, 
synthesis thereof, generation of a logical structure, and 
determination of the degree of coincidence thereamong or 
the like, which makes it possible to improve the workability 
for conversion from a plain text document with a specific 
form to a SGML document. 

In the apparatus for processing SGML document accord- 
ing to the present invention, a specific form has fields such 
as ruled lines, an application form, or form paper or the like, 
for inputting the data so that a user just inputs data into the 
fields without bothering to add tags thereto, which makes it 
possible to reduce the load on the user. 

The apparatus for processing SGML document according 
to the present invention, further reconverts the plain text 
document from the SGML document by referring to the 
registered conversion table, so that it is possible to realize a 
two-directional conversion between a plain text document 
and a SGML document with higher workability in case of 
documents having a specific form. 

The method for processing SGML document according to 
the present invention comprises steps for generating a con- 
version table for conversion between a prototype document 
having a specific form and document type definition by 
correlating the field of the prototype document with the tags 
of the document type definition, and registering therein the 
generated conversion table so that it can be referred to when 
a plain text document prepared by using the prototype 
document is converted to a SGML document having tags. 
For this reason a conversion table corresponding to the 
specific form may be generated only once, and since it is not 
required to prepare samples such as components, markup 
documents, and document type definitions every time like in 
the case of the conventional technology, it is possible to 
improve the workability thereof. 

The method for processing SGML document according to 
the present invention comprises steps for previously regis- 
tering a conversion table correlating fields of a prototype 
document with tags of document type definition for conver- 
sion between a prototype document having a specific form 
and the document type definition, preparing a plain text 
document using the specific form, converting the prepared 
plain text document to a SGML document having tags by 
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referring to the registered conversion table, and converting 
in reverse the converted SGML document to the plain text 
document by referring to the registered converting table, so 
that, if a conversion table corresponding to the specific form 

5 is generated once, a required work thereafter and on is only 
to prepare a plain text document in a specific form. For this 
reason, it is not required to prepare samples such as 
components, markup documents, and document type defi- 
nition are prepared every time like in the case of the 

10 conventional technology for execution of processing such as 
retrieval of the samples, synthesis thereof, generation of a 
logical structure, and determination of the degree of coin- 
cidence thereamong or the like, which makes it possible to 
improve the workability for conversion from a plain text 

15 document with a specific form to a SGML document and it 
is also possible to realize a two-directional conversion 
between a plain text document and a SGML document with 
higher workability in preparation of a document having a 
specific form. 

20 The recording medium according to the present invention 
stores therein a program for making a computer execute the 
steps described above, whereby the program becomes 
machine-readable, and with this feature, operations in these 
steps can be realized with a computer. 

25 This application is based on Japanese patent application 
No. HEI 9-118257 filed in the Japanese Patent Office on 
May 8, 1997, the entire contents of which are hereby 
incorporated by reference. 

Although the invention has been described with respect to 

30 a specific embodiment for a complete and clear disclosure, 
the appended claims are not to be thus limited but are to be 
construed as embodying all modifications and alternative 
constructions that may occur to one skilled in the art which 
fairly fall within the basic teaching herein set forth. 
What is claimed is: 

1. An apparatus for processing SGML document com- 
prising: 

a conversion form generation unit for generating a con- 
40 version form for conversion between a prototype docu- 
ment and document type definition by correlating struc- 
tural elements in said prototype document to those in 
said document type definition respectively; and 
a registration unit for registering the conversion form 
45 generated by said conversion form generation unit so 
that it can be referred to when a plain text document 
prepared by using said prototype document is con- 
verted to a SGML document. 

2. An apparatus for processing SGML document com- 
50 P risin S : 

a conversion table generation unit for generating a con- 
version table for conversion between a prototype docu- 
ment and document type definition by correlating fields 
of said prototype document having a specific form with 

55 tags of said document type definition; and 

a registration unit for registering the conversion table 
generated by said conversion table generation unit so 
that it can be referred to when a plain text document 
prepared by using said prototype document is con- 

60 verted to a SGML document having tags. 

3. An apparatus for processing SGML document com- 
prising: 

a registration unit for previously registering a conversion 
form correlating structural elements in a prototype 
65 document to those in document type definition for 
conversion between said prototype document having a 
specific form and said document type definition; 
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a preparation unit for preparing a plain text document prepared by using said prototype document is con- 
using the specific form; and verted to a SGML document having tags. 

a conversion unit for converting the plain text document 9. A method for processing SGML document comprising: 

prepared by said preparation unit to a SGML document a preparation step of preparing a plain text document 

by referring to the conversion form registered in said 5 using a specific form; 

registration unit. a conversion step of converting the plain text document 

4. An apparatus for processing SGML document com- prepared in said preparation step to a SGML document 
prising: having tags by referring to the conversion table for 

a registration unit for previously registering a conversion conversion between a prototype document having a 

table correlating fields of a prototype document with 10 specific form and document type definition correlating 

tags of document type definition for conversion fields of said prototype document with tags of said 

between said prototype document having a specific document type definition; and 

form and said document type definition; a reverse conversion step for reconverting the SGML 

a preparation unit for preparing a plain text document document converted in said conversion step to the plain 

using the specific form; and text document by referring to said conversion table, 

a conversion unit for converting the plain text document A 10 ; A computer-readable recording medium storing 

prepared by said preparation unit to a SGML document therem . a P ro S ram m *™« a computer execute said method 

having tags by referring to the conversion table regis- according to a method for processing SGML document 

tered in said registration unit. 2Q comprising a conversion table generation step for generating 

5. An apparatus for processing SGML document accord- * conversion table for conversion between a prototype 
ing to claim 2; wherein said specific form has fields such as ^ document type definition by correlating fields 
ruled lines, application form, or form paper or the like for of ™* Prototype document having a specific form to tags of 
inouttine the data s document type definition, and a registration step of 

6. An apparatus for processing SGML document accord- ^ therein the conversion table generated in said 
ing to claim 4; wherein said specific form has fields such as conversion table generation step so that it can be referred to 
ruled hues, application form, or form paper or the like for when a phm text doaiment prepared by usmg said prototype 
inputting the data document is converted to a SGML document having tags. 

7. An apparatus for processing SGML document accord- u 11 . A computer-readable recording medium storing 

ing to claim 4 further comprising: 3 o ±C ™ * pr ° gram m * k ? g r * ™* *"? method 
. p according to a method for processing SGML document 
a reverse conversion unit for reconverting the SGML ^ a preparation step of preparing a p i ain text 
document converted by said conversion unit to the document ^ a ific fornl7 a ^^ion step of con- 
plain text document by referring to the conversion table vefti ^ ^ , e)£t d m ^ preparation 
registered by said registrator i unit. a S(JML docum6nt havin u b referri t0 the 

8. A method for processing SGML document comprising: 35 „ * - k l,l„ n . . ° , 

r to r & conversion table tor conversion between a prototype docu- 

a conversion table generation step for generating a con- ment having a specific form and document type definition 

version table for conversion between a prototype docu- correlating fields of said prototype document with tags of 

ment and document type definition by correlating fields sa jd document type definition, and a reverse conversion step 

of said prototype document having a specific form to f or reconverting the SGML document converted in said 

tags of said document type definition; and 40 conversion step to the plain text document by referring to 

a registration step of registering therein the conversion said conversion table, 
table generated in said conversion table generation step 

so that it can be referred to when a plain text document ***** 
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